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ABSTRACT 



A 3-D graphics chip includes independent internal DRAM 
buffers each having a wide bandwidth access bus for con- 
nection to a 3-D texture rendering drawing engine. The 3-D 
drawing engine takes advantage of a flexible embedded 
memory interface to reduce the traditional 3-D pipeline 
delay by a factor of 3. In a specific embodiment, each of 
three drawing processes — texture, Z, pixel — retrieves and 
stores information in a separate embedded drawing buffer 
via separate wide bandwidth access busses. Access to an 
external memory is provided via a separate external access 
bus. In another specific embodiment, the 3-D drawing 
engine accesses the embedded drawing buffers via read and 
write FIFO's to maximize the drawing process throughput. 

7 Claims^ 6 Drawing Sheets 
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3-D GRAPHICS CHIP WITH EMBEDDED and therefore a dramatic improvement in performance, the 

DRAM BUFFERS announced architectures appear to have shortcomings which 

will severely undercut the manufacturers' claims. 

The present patent application is a continuation of prior SUMMARY OF THE INVENTION 

applicationSer.No.09/^^^^^^^ 5 ^^^^^^^ ^ 

EMBE^^^^ the internal memory as multiple U^nJi 

hMBbUUbU UKAM uu^btK^i. memory banks, each having its own very wide access bus 

FIELD OF THE INVENTION memory controller. The present invention takes advan- 
tage of this internal memory organization by modifying a 

The invention relates to the field of computer graphics traditional 3-D drawing engine to take advantage of the 

devices and more specifically to specialized processors for memory architecture to increase the 3-D drawing process 

accelerating a color, 3-dimensional ("3-D") drawing process throughput by a factor of 3. This result is accomplished by 

used with laptop computers having high-end game and connecting each of three drawing processes which comprise 

multimedia capability. a traditional 3-D drawing process to a separate internal 

BACKGROUND OF THE INVENTION "'"'""^ '""^ reconfiguring the three drawing processes 

to reduce unnecessary delays. 

The acceleration of a color, texture rendering 3-D drawing In a specific embodiment, the internal memory is imple- 

process is well known in the field of computer graphics. In mented as independent memory banks of at least 1 MB each, 

the past such accelerators have relied primarily upon the use Each independent memory bank has its own access bus, 

of external 3-D drawing buffers for storage of 3-D process typically 128 bits wide. The 3-D drawing engine connects to 

control and 3-D drawing information. However, the use of the memories via read and write FIFO's of equivalent width, 

external drawing bufiFers has made it diflScuh to increase The modified 3-D drawing engine defines * concurrent' 3-D 

accelerator throughput. Yet continuing advances in com- drawing processes as opposed to a traditional 'pipelined' 

puter graphics capabilities have pushed a demand for higher architecture. 

bandwidth 3-D drawing processes, having greater ^ in another specific embodiment of a 3-D graphics chip 
throughput, to support high-end games and multimedia having embedded DRAM buffers, the invention includes a 
applications. matrix of programmable switches which independently con- 
This need for greater throughput has been a special nect each of several interrelated drawing processes to one of 
challenge to the designers of 3-D graphics accelerators for the independent internal buffers. The switch matrix is pro- 
use in high-end laptop computers and in some battery- grammable by a host computer and is useful for adjusting the 
operated hand-held devices. These special markets place a graphics accelerator to meet the needs of specific appUca- 
premium on small size and low operating power. The tions. The switch matrix permits any drawing engine to 
designers of these specialized chips face practical limitations communicate with any internal memory bank, once a host 
to solving their problems by increasing the number of I/O selection has been made. 

pins at the periphery of a chip in an effort to continue using in another specific embodiment, the invention includes an 

external drawing buffers. A typical external memory bus external bus permitting one or more drawing buffers to be 

now includes 64 lines which cannot be shared with other I/O located external to the graphics accelerator. A programmable 

signals. Yet many of these chips already have in excess of switch matrix is used to connect selected drawing processes 

200 I/O pins, thus making the addition of 100-200 pins ^ to either an internal buffer or to an external buffer via the 

impractical. For this reason, the industry has begun to move external bus, thus providing a simple form of drawing 

away from external memory and toward internal memory. process output to the external buffer. 

,«ooTx"l^' T"" '"^'"''''^ small Static-RAM gj^j^p deSCRIFHON OF THE DRAWING 
("SRAM ) cache memoncs m an effort to provide the higher 

throughput ("effective processing bandwidth"). A few 3-D 45 ^ further understanding of the objects, features and 

graphics accelerators have implemented larger internal advantages of the present invention, reference should be had 

SRAM buffers for storage of setup and intermediate drawing t^e following description of the preferred embodiment, 

information, but these are limited to desktop computers ^aken in conjunction with the accompanying drawing, in 

where size and power are not a major consideration. which like parts are given like reference numerals and 

The use of SRAM for internal storage has several draw- 50 

backs for accelerators intended for the high-end laptop FIG. 1 is a block diagram lUustratmg a 3-D graphics chip 

market: (1) if limited to internal cache memory only, the ^i^h embedded DRAM buffers, accordmg to one aspect of 

improvement in processing bandwidth is not significant; (2) present invention. 

SRAM requires much layout area, limiting the amount of FIG. 2 is a block diagram of a 3-D graphics chip with 

storage which can be provided, and hence the improvement 55 embedded DRAM buffers and having a wide bandwidth 

in processing bandwidth; and (3) SRAM is a power hungry access bus and an access priority engine, according to 

technology not lending itself to use in battery operated another aspect of the invention. 

equipment. These limitations prevent successful use of FIG. 3 is a block diagram which illustrates a 3-D graphics 

SRAM for large internal memories on the order of 2 MB or chip with dual embedded buffers, each having a wide 

more, which is a size required for the needed improvement 60 bandwidth access bus, and the chip also having an access 

in processing throughput. bus for connection to an external buffer. 

Recently, several manufacturers have proposed 3-D FIG. 4 is a block diagram of another embodiment of the 

graphics accelerators for use in the high-end laptop market 3-D graphics chip showing a single embedded buffer parti- 

which include dynamic-RAM ("DRAM") for internal buffer tioned into two drawing buffers. 

storage. DRAM has the advantages of lower power con- 65 FIG. 5 is a block diagram which illustrates a plurality of 

sumption and smaller layout area for a given memory size. programmable switches used to selectively connect read and 

Though these proposed devices promise 2 MB memories write FIFO's to one of two embedded DRAM buffers. 
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FIG. 6 is a block diagram which illustrates a 3-D graphics a Z-engine 26; a pixel-engine 28 and the mask 32; and a 

chip having a pair of independent embedded DRAM texture-engine 30. The Z-engine drawing process is related 

memories, a group of programmable switches controllable to the pixel-engine drawing process by the line 27 which 

by a host device for connecting portions of a 3-D drawing carries Z-mask information from the Z-cngine 26 to the 
engine to one or the other embedded memory, or to an 5 °iask 32. The texture-engine drawing proccK is related to 

external memory. pixel-engine drawing process by the line 31 which 

n' uiij- c ^ u-i.- Carries texture Information from the texturc engine 30 to the 

no. 7 .s a block diagram of a 3-D graphics chip having j^^, ^8 where the texture is blended with previous 

a pau of mdependcnt embedded DRAM memones for ^^^j iufomjation 

storing drawingengineoontrolinformation and 3-0 drawing ^ .^^^^^^^^ ^^^^^^ ^6 communicates 

mtormation. ^.^j^ Z-buflfer 34 via the read HFO 16 and the write FIFO 

DESCRIPTION OF THE PREFERRED ^ specific embodiment, all the FIFO's shown in FIG. 

EMBODIMENT ^ wide. In similar manner, the pixel engine 28 

and the mask 32 communicate with the display buffer 36 via 

With reference to FIG. 1, there is shown a block diagram the read FIFO 18 and the write FIFO 24. Finally, the texture 

of a 3-D graphics chip having embedded DRAM buffers engine 30 communicates with the texture buffer 38 via the 

according to one aspect of the present invention. The 3-D read FIFO 20. Aperson skiUed in the 3-D drawing engine art 

graphics chip is designated generally by the reference will recognize that the traditional 3-D drawing engine has 

numeral 10, and includes a 3-D drawing engine 12, embed- been modified to take advantage of the concurrent access of 

ded DRAM buffers 14, a plurality of read FIFO's 16, 18, 20, the three drawing buffers 34, 36, 38. 

and write FIFO's 22, 24. Hie 3-D drawing engine 12 ^ traditional 3-D drawing engine the drawing buffers 

mcludes a Z-engme 26, a pixel engine 28, a texmre engine ^^e typicaUy located in a single external memory which the 

30, and a mask 32. The embedded DRAM buffers 14 include ^-D drawing engine accesses via a single 64-bit wide 

a Z-buffer 34, a display buffer 36, and a texture buffer 38. ^^^^^ ^us. The need to access the three drawing buffers 

Typically, the graphics chip 10 is implemented on a single t^e single narrow bus necessitates the pipeline architec- 

substrate using a reduced voltage level CMOS technology, ^ure requiring 21 unit time intervals for priming the pipeline 

for example 3.3V CMOS. (in the example of drawing 10 pixels), and 3 unit time 

In a specific embodiment of the invention, the display intervals for processing each fiilly rendered color drawing 
buffer 36 is partitioned into a pair of buffers, commonly pixel. Some of the 21 unit time intervals are required for 
referred to as a front display buffer and a back display buffer. fetching drawing information from the external memory, 64 
The 3-D drawing engine 12 works with the contents of one bits at a time. The remaining unit time intervals are required 
of the two display buffers, while the contents of the other for processing the fetched drawing information to form 
display buffer is available for output to a display device. updated drawing information, and for writing the updated 
When the 3-D drawing engine completes its work with the information back to the external memory. This sequence 
contents of the one display buffer, it begins to work with the must be repeated for each 3-D drawing pixel, 
contents of the other display buffer. The display output is the present invention, however, the three drawing 
taken from the contents of the display buffer the 3-D processes concurrently fetch new drawing information, pro- 
drawing engine 12 has most recently completed. ^ess current drawing information, and store updated drawing 

The 3-D drawing engine 12 implements a modification of information in a continuous stream of 3-D drawing pixels, 
a traditional 3-D texture-blending drawing algorithm such as Tht result is that the delay through the drawing pipeline is 
disclosed in "Direct3D API Specification," by the Microsoft reduced to the minimum required to property interrelate the 
Corp., and the OPENGL^" specification. A traditional 3-D three drawing processes. This minimum is one unit time 
drawing engine accesses drawing information from an exter- interval, versus the 3 unit time intervals of the traditional 
nal memory via a 64-bit wide bus, and employs a pipelined 3-D drawing pipeline. So every unit time interval the 3-D 
architecture which imposes a delay on output of a fully 45 drawing engine 12 of the present invention fetches, pro- 
rendered color pixel. For example, if 10 pixels are to be cesses and updates a new 3-D drawing pixel, 
drawn, the traditional 3-D dravwng engine typically requires Part of the speedup is a result of working with 128 bits at 
21 unit time intervals to output a first fully rendered color a time versus 64 bits. Another part of the speedup is a result 
pixel. Once the pipeline is primed, a new fully rendered of using mdependent intemal memories versus a single 
color pixel is output each 3 unit time periods. jq external memory. And another part of the speedup is a result 

The present invention modifies the traditional pipeline to of reorganizing the relationships of the Z-engine 26, the 

shorten the pipeline delay by a factor of 3. But in addition, pixel engine 28, and the texture engine 30 to take advantage 

the read FIFO's which connect the 3-D drawing engine with of the concurrent nature of the memory access to the 

the embedded memories operate independently. Thus draw- embedded DRAM buffers 14. A person having an ordinary 
ing information which requires 21 unit time intervals to 55 skill in the art will appreciate that, if it were practical to do 

prime the pipeline in a traditional 3-D drawing engine, is so, external memory access bandwidth could be increased 

ready to output a first fully rendered color pixel in one unit by adding sufficient I/O pins to widen an external memory 

time interval and to output successive fully rendered color access bus. This expedient, however, will be appreciated to 

pixels each one unit time interval thereafter. Thus, the earlier be impractical given the current stale of the packaging and 
statement that the 3-D drawing engine 12 processes a fully go the interconnect arts. 

rendered color pixel at 3 times the throughput of the tradi- FIG. 2 is a block diagram of a 3-D graphics chip with 

tional engine. The traditional engine operates upon data embedded DRAM buffers and having a wide bandwidth 

fields that are 64 bits wide, while a specific embodiment of access bus and an access priority engine, according to 

the present invention operates upon data fields that are 128 another aspect of the invention. The graphics chip is desig- 
bits wide. 65 nated by the reference numeral 40, and is typically imple- 

As a result of the modification, the 3-D drawing engine mented on a single substrate using a reduced voltage level 

defines three concurrent and interrelated drawing processes: CMOS technology. 
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The 3-D graphics chip 40 includes a 3-D drawing engine 
42, embedded DRAM buffers 44, read FIFO's 46, 48, 50, 
write FIFO's 52, 54, an access priority engine 56, and a wide 
bandwidth access bus 58, 

The 3-D drawing engine 42 and the FIFO's 46-54 cor- 
respond to the 3-D drawing engine 12 and FIFO's 16-24 
shown in FIG. 1. The FIFO's 46-54 are connected to the 
embedded DRAM buffers 44 via the wide bandwidth access 
bus 58, and each FIFO is connected to the access priority 
engine 56. In a specific embodiment of the graphics chip 40, 
the width of the wide bandwidth access bus is 128 bits. 

The purpose of the access priority engine 56 is to deter- 
mine which FIFO's are requesting access to the embedded 
DRAM buffers via the wide bandwidth access bus 58, and 
to grant access to each requesting FIFO, one-at-a-time, such 
that each concurrent drawing process (FIG. 1) obtains 
needed buffer access. The access priority engine grants 
access requests in a dynamic manner such that no FIFO or 
group of FIFO's is permitted to 'hog' the wide bandwidth 
access bus 58. The access priority engine 56 maximizes the 
use of the available bus bandwidth by insuring two things: 
(1) that no FIFO or group of FIFO's can prevent any 
drawing process from gaining needed buffer access; and (2) 
that if the bus is free and at least one FIFO is requesting 
access, the at least one FIFO will be granted bus access. 
This,prooess results in a tendency for read FIFO's to fill and 
for write FIFO's to empty. 

It is important to keep in mind, however, that even though 
the FIFO's are competing for access and that the access, on 
a short term basis, is not always granted in the same 
sequence, the drawing information being read from the read 
FIFO's by the concurrent drawing processes, and stored into 
the write FIFO's by the drawing processes, occurs in an 
order determined by the interrelated drawing processes. The 
FIFO's and access priority engine permit the drawing pro- 
cesses to access the embedded buffers at a maximum rate 
determined by the bandwidth of the wide bandwidth access 
bus. 

Another specific embodiment of the invention is illus- 
trated in FIG. 3, a block diagram showing a 3-D graphics 
chip with dual embedded buffers, each having a wide 
bandwidth access bus, and the chip also having an access 
bus for connection to an external buffer. The graphics chip 
is designated by the reference numeral 60, and is typically 
implemented on a single substrate using a reduced voltage 
level CMOS technology. 

The 3-D graphics chip 60 includes a texture engine 62, a 
Z-engine 64, a pixel engine 66, read FIFO's 68, 70, write 
FIFO's 72, 74, 76, wide bandwidth access busses 78, 80, an 
external access bus 82, an embedded DRAM texture buffer 
84, and an embedded DRAM Z-buffer 86. A person having 
an ordinary level of skill in the art will appreciate that the 
texture engine 62 corresponds to the texture engine 30 of 
FIG. 1, while the Z-engine 64 corresponds to the Z-engine 
26 of FIG. 1, and the pixel engine 66 corresponds to the pixel 
engine 28 and the mask 32 of FIG. 1. It will also be 
appreciated that the three 3-D drawing engines 62-66 imple- 
ment concurrent, interrelated drawing processes as 
described above with respect to FlG's 1, 2. 

Further improvement in the drawing process throughput is 
obtained using the architecture illustrated in FIG. 3. The 
embedded DRAM buffer is implemented as separate 
memory banks 84, 86, each with its own wide bandwidth 
access bus 78, 80, respectively. Each access bus has its own 
access priority engine (56 of FIG. 2), In a specific embodi- 
ment of the invention, the wide bandwidth access bus 78 has 
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a width of 128 bits, while the width of bus 80 is also 128 bits, 
and the width of the external access bus is 64 bits. 

In the specific embodiment illustrated in FIG. 3, the 
texture buffer resides within the embedded DRAM memory 

5 bank 84. As shown in FIG. 1, the texture engine 30 does not 
write back to the texture buffer 38. Therefore in FIG. 3 only 
the read FIFO 68 accesses the embedded DRAM memory 
bank 84 via the wide bandwidth access bus 78. The Z-cngine 
64 of FIG. 3 accesses the Z-buffer in the embedded DRAM 

10 memory bank 86 via read FIFO 70, write FIFO 72, and the 
wide bandwidth access bus 80. The pixel engine 66 accesses 
a display buffer located in an external memory (not shown). 

As a result of the arrangement illustrated in FIG. 3, each 
drawing engine is able to access its respective drawing 
buffer without competition from another drawing engine. In 
addition, the location of the display buffer in an external 
memory provides a display output (recall that the display 
buffer is typically partitioned into * front' and 'back' buffers 
and that one of the buffers is always being used by the pixel 
engine while the alternate buffer is being used by a display 
process, and that the two buffers are interchanged as the 
pixel engine and the display process complete with the 
contents of their respective buffers). 
The consequence of the lack of competition for buffer 

^ access between the interrelated concurrent 3-D drawing 
processes is that 3-D drawing engine throughput is signifi- 
cantly increased. 

FIG. 4 is a block diagram which illustrates another 
specific embodiment of a 3-D graphics chip having embed- 
ded DRAM buffers in which the texture buffer and the 
Z-buffer reside within a single embedded DRAM while the 
display buffer is located in an external memory. 

The graphics chip is designated by the reference numeral 

35 90, and includes a texture engine 92, a Z-engine 94, a pixel 
engine 96, read FIFO's 98, 100, write FIFO's 102, 104, 106, 
a wide bandwidth access bus 108 and corresponding access 
priority engine (not shown), an external access bus 110 and 
corresponding access priority engine (not shown), a single 
embedded DRAM memory bank, designated generally by 
the reference numeral 112, including a texture buffer 114 and 
a Z-buffer 116. 

Though not illustrated in FIG. 4, a person having an 
ordinary level of skill in the art will appreciate that the wide 

45 bandwidth access bus 108 and the external access bus 110 
each include a corresponding priority access engine which 
serves the same function as that described with respect to the 
priority access engine 56 of FIG. 2. 

The effective drawing process throughput in the specific 

50 embodiment illustrated in FIG. 4 is not as high as the 
throughput for the embodiment illustrated in FIG. 3. The 
reason for the difference is readily apparent by examining 
the relationship between the texture engine 92 and the 
texture buffer 114 on the one hand and the Z-engine 94 and 

55 the 2^buffer 116 on the other hand. In FIG. 3 each 3-D 
drawing engine accesses its respective buffer via a separate 
access bus, while in FIG. 4 the two 3-D drawing engines 92, 
94 access their respective drawing buffers via the single 
wide bandwidth access bus 108. In the embodiment illus- 

60 trated in FIG. 3, the Z-engine 64 and the texture engine 62 
do not compete for access to their respective drawing 
buffers, while in FIG. 4 the two 3-D drawing engines 92, 94 
do compete. 

It should be pointed out that the 3-D drawing engines in 
65 each of FIGS. 1-4 are able to operate more rapidly than the 
effective bandwidths of their respective drawing buffer 
accesses. Thus, if the access bandwidth is increased, the total 
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graphics chip throughput will be increased. It is for that 
reason that going from the single bank of embedded drawing 
buffers 112 in FIG. 4 to the dual banks of embedded drawing 
buffers 84, 86 in FIG. 3 provides a significant increase in 
drawing process throughput. 

In another embodiment of the invention, a matrix of 
programmable switches permits any FIFO to be connected 
to any one of several wide bandwidth access busses, depend- 
ing upon the number of such busses in the embodiment. FIG. 
5 is a block diagram which illustrates a portion of a 3-D 
graphics chip designated generally by the reference numeral 
120. 

The graphics chip 120 includes a matrix of programmable 
switches 122, a write RFO 124, two read FIFO*s 126, 128, 
a first embedded DRAM 130, a second embedded DRAM 
132, a first wide bandwidth access bus 134 and correspond- 
ing first access priority engine 136, and a second wide 
bandwidth access bus 138 and corresponding second access 
priority engine 140. 

TTie switches of the matrix 122 are all independently 
programmable, enabling a connection to be established 
between any FIFO 124, 126, 128 and either wide bandwidth 
access bus 134, 138 and corresponding access priority 
engine 136, 140. Such programmable switching matrices are 
well known in the art. In a specific embodiment, the switches 
are programmable by a host device. 

The use of the switch matrix 122 introduces a significant 
flexibility into the 3-D graphics chip as it permits any 
drawing buffer to be located in either embedded DRAM 130, 
132 or, because each switch of the matrix 122 is individually 
programmable, to be located within a single embedded 
DRAM. A person having an ordinary level of skill in the art 
will appreciate that the embodiment illustrated in FIG. 5 is 
exemplary only and that the switch matrix can be applied in 
additional ways to increase the flexibility of the 3-D graph- 
ics chip 120. 

FIG. 6 is a block diagram which illustrates another 
specific embodiment of a 3-D graphics chip having embed- 
ded DRAM buffers. The graphics chip is designated by the 
reference numeral 150, and is typicaUy implemented on a 
single substrate using a reduced voltage level CMOS tech- 
nology. 

The 3-D graphics chip 150 includes a 3-D drawing engine 
152, a plurality of read FIFO's 154, a plurality of write 
FIFO's 156, a programmable switch matrix 158, two embed- 
ded DRAM memory banks 160, 162, two wide bandwidth 
access busses 164, 166, with corresponding priority access 
engines 170, 172, an external access bus 174 with corre- 
sponding priority access engine 176, and a host access port 
178. 

Though not explicitly shown, the 3-D drawing engine 152 
includes a texture engine (62 of FIG. 3), a Z-cngine (64 of 
FIG. 3), and a pixel engine (66 of FIG. 3). The read FIFO's 
154 include one read FIFO for each of the three specialized 
drawing processes (texture, Z, pixel) corresponding to read 
FIFO's 68, 70, 74 of HG. 3. The write FIFO's 156 include 
one write FIFO for each of the Z and pixel drawing 
processes corresponding to write FIFO's 72, 76 of FIG. 3. 

In a specific embodiment of the 3-D graphics chip shown 
in FIG. 6, each wide bandwidth access bus 164, 166 is 128 
bits wide. Each embedded DRAM memory bank 160, 162 
provides 2 MB of storage. Each read FIFO and each write 
FIFO is 128 bits wide by 64 bits deep. The external access 
bus 174 is 64 bits wide. When the programmable switch 
matrix 158 is configured to place a drawing buffer in an 
external memory, the FIFO (or FIFO's) for that drawing 
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buffer are programmed to be 64 bits wide in order to match 
the width of the external access bus to the width of the 
FIFO's. 

In a specific application, the switch matrix 158 is config- 
5 ured (via the host access port 178) to locate the texture buffer 
in the embedded DRAM memory 160, to locate the Z-buffer 
in the embedded DRAM memory 162, and to locate the 
display buffer in an external memory via connection through 
the external access bus 174. 

In another specific application, the switch matrix 158 is 
configured to locate both the texture buffer and the Z-buffer 
in the embedded DRAM memory 160, and to locate the 
display buffer in the embedded DRAM memory 162. 

In yet another specific embodiment, the programmable 
switch matrix 158 of the 3-D graphics chip 150 is configured 
to locate drawing control information in a specific embedded 
memory bank. An example of this embodiment is illustrated 
in FIG. 7 which is a block diagram of a 3-D graphics chip 
designated by the reference numeral 180 (the programmable 
switch matrix is not shown). 

The 3*D graphics chip 180 includes a geometry engine 
182, an anti-aliasing polygon engine 184, a first embedded 
DRAM memory bank 186 defining a display list buffer, a 

25 second embedded DRAM memory bank defining an anti- 
aliasing buffer, a first wide bandwidth access bus 190 (and 
access priority engine, not shown, but see FIGS. 2, 6), a 
second wide bandwidth access bus 192 (and access priority 
engine, not shown), read FIFO's 194, 196, and write FIFO 

30 198. 

The display list buffer 186 stores geometric attributes of 
3-D graphics. The display list buffer 186 is initially loaded 
by the host (see host port 178 of FIG. 6) and the contents of 
the display list buffer are locally available, so that it is 

35 unnecessary to request these from the host each time they are 
needed. In actual use, the geometry engine 182 operates 
concurrently with the 3-D drawing engine (152 of FIG. €). 
Thus a portion of the embedded memories 186, 188 is 
devoted to a Z-buffer (86 of FIG. 3) and a texture buffer (84 

40 of FIG. 3) while the portion shown is used for a display list 
buffer. 

The anti-aliasing buffer stores 3-D drawing information, 
and the anti-aliasing polygon engine 184 is part of a 3-D 
drawing engine (not previously shown). Thus FIG. 7 illus- 
45 trates the embedded DRAM memories used to store both 
drawing engine control information (the display list buffer 
186), and 3-D drawing information (the anti-aliasing buffer 
188). 

While the invention has been described in relation to the 
embodiments shown in the accompanying drawing figures, 
other embodiments, alternatives and modifications will be 
apparent to those skilled in the art. It is intended that the 
specification be exemplary only, and that the true scope and 
spirit of the invention be indicated by the following claims. 

What is claimed is: 

1. A 3-D graphics chip having embedded DRAM buffers, 
the chip comprising: 

an embedded DRAM partitioned into drawing buffers; 
50 a 3-D drawing engine configured to concurrently access 
the embedded drawing buffers for providing rendered 
3-D drawing information, the 3-D drawing engine 
implementing concurrent drawing processes, the draw- 
ing processes being interrelated in a predetermined 
55 manner; 

output means permitting the graphics chip to output the 
rendered 3-D drawing information; 
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a plurality of independent read FIFO's and write FIFO's 
used by the concurrent drawing processes to access the 
embedded buffers; and 

further including an access priority engine, communicat* 
ing with all FIFO's needing access to a given drawing 
buffer, and granting access priority in a dynamic man- 
ner to each such FIFO, one-at-a-lime, thereby insuring 
that each concurrent drawing process obtains needed 
buffer access. 

2. The 3-D graphics chip as set forth in claim 1, further 
including a wide bandwidth access bus for connecting a 
FIFO having the granted access priority with the embedded 
DRAM. 

3. The 3-D graphics chip as set forth in claim 1, further 
including each FIFO having its own address register and 
offset register permitting each FIFO to access the entire 
address space of the embedded DRAM. 

4. A 3-D graphics chip having embedded DRAM buffers, 
the chip comprising: 

an embedded DRAM partitioned into drawing buffers; 

a 3-D drawing engine configured to concurrently access 
the embedded drawing buffers for providing rendered 
3-D drawing information, the 3-D drawing engine 
implementing concurrent drawing processes, the draw- 
ing processes being interrelated in a predetermined 
manner; 
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output means permitting the graphics chip to output the 

rendered 3-D drawing information; 
a plurality of independent read FIFO's and write FIFO's 
used by the concurrent drawing processes to access the 
5 embedded buffers; and 

wherein the embedded DRAM being arranged in at least 
two independent memory banks, and each memory 
bank having a separate wide bandwidth access bus, and 
each access bus having a corresponding access priority 
IQ engine communicating with the FIFO's. 

5. The 3-D graphics chip as set forth in claim 4, further 
including a plurality of programmable switches, and each 
FIFO being connectable to a selected access bus and a 
corresponding access priority engine via the programmable 

J J switches. 

6. The 3-D graphics chip as set forth in claim 5, further 
including at least two drawing buffers located within a single 
memory bank, and the FIFO's for the corresponding draw- 
ing processes being connected to the access bus and the 
priority engine for said memory bank via the programmable 
switches. 

7. The 3-D graphics chip as set forth in claim 5, further 
including means permitting a host device to program said 
switches for configuring the drawing processes via FIFO's 

2^ to drawing buffers in selected memory banks. 

* :lf * * * 
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